Overview

Dataset Statistics

Number of Variables 17
Number of Rows 45211
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 29.2 MB
Average Row Size in Memory 677.2 B
Variable Types
  • Numerical: 7
  • Categorical: 10

Dataset Insights

balance is skewed Skewed
duration is skewed Skewed
campaign is skewed Skewed
pdays is skewed Skewed
previous is skewed Skewed
month has constant length 3 Constant Length
balance has 3766 (8.33%) negatives Negatives
pdays has 36954 (81.74%) negatives Negatives
balance has 3514 (7.77%) zeros Zeros
previous has 36954 (81.74%) zeros Zeros

Variables


age

numerical

Approximate Distinct Count 77
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 40.9362
Minimum 18
Maximum 95
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.6848)

Quantile Statistics

Minimum 18
5-th Percentile 27
Q1 33
Median 39
Q3 48
95-th Percentile 59
Maximum 95
Range 77
IQR 15

Descriptive Statistics

Mean 40.9362
Standard Deviation 10.6188
Variance 112.7581
Sum 1.8508e+06
Skewness 0.6848
Kurtosis 0.3194
Coefficient of Variation 0.2594
  • age is not normally distributed (p-value 0.0034309445454909484)
  • age has 487 outliers

job

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3367566

Length

Mean 9.4855
Standard Deviation 1.8211
Median 10
Minimum 6
Maximum 13

Sample

1st row management
2nd row technician
3rd row entrepreneur
4th row blue-collar
5th row unknown

Letter

Count 412369
Lowercase Letter 412369
Space Separator 0
Uppercase Letter 0
Dash Punctuation 11311
Decimal Number 0

marital

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3247609
  • The largest value (married) is over 2.13 times larger than the second largest value (single)

Length

Mean 6.8323
Standard Deviation 0.6082
Median 7
Minimum 6
Maximum 8

Sample

1st row married
2nd row single
3rd row married
4th row married
5th row single

Letter

Count 308894
Lowercase Letter 308894
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (married, single) take over 50.0%
  • The largest value (married) is over 2.13 times larger than the second largest value (single)

education

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3314897
  • The largest value (secondary) is over 1.74 times larger than the second largest value (tertiary)

Length

Mean 8.3206
Standard Deviation 0.7766
Median 9
Minimum 7
Maximum 9

Sample

1st row tertiary
2nd row secondary
3rd row secondary
4th row unknown
5th row unknown

Letter

Count 376182
Lowercase Letter 376182
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (secondary, tertiary) take over 50.0%
  • The largest value (secondary) is over 1.74 times larger than the second largest value (tertiary)

default

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3029952
  • The largest value (no) is over 54.47 times larger than the second largest value (yes)

Length

Mean 2.018
Standard Deviation 0.133
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 91237
Lowercase Letter 91237
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

balance

numerical

Approximate Distinct Count 7168
Approximate Unique (%) 15.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 1362.2721
Minimum -8019
Maximum 102127
Zeros 3514
Zeros (%) 7.8%
Negatives 3766
Negatives (%) 8.3%
  • balance is skewed right (γ1 = 8.36)

Quantile Statistics

Minimum -8019
5-th Percentile -172
Q1 72
Median 448
Q3 1428
95-th Percentile 5768
Maximum 102127
Range 110146
IQR 1356

Descriptive Statistics

Mean 1362.2721
Standard Deviation 3044.7658
Variance 9.2706e+06
Sum 6.159e+07
Skewness 8.36
Kurtosis 140.7358
Coefficient of Variation 2.2351
  • balance is not normally distributed (p-value 4.053767129621663e-22)
  • balance has 4729 outliers

housing

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3054267

Length

Mean 2.5558
Standard Deviation 0.4969
Median 3
Minimum 2
Maximum 3

Sample

1st row yes
2nd row yes
3rd row yes
4th row yes
5th row no

Letter

Count 115552
Lowercase Letter 115552
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (yes, no) take over 50.0%

loan

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3036381
  • The largest value (no) is over 5.24 times larger than the second largest value (yes)

Length

Mean 2.1602
Standard Deviation 0.3668
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row yes
4th row no
5th row no

Letter

Count 97666
Lowercase Letter 97666
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

contact

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3290289
  • The largest value (cellular) is over 2.25 times larger than the second largest value (unknown)

Length

Mean 7.7763
Standard Deviation 0.5497
Median 8
Minimum 7
Maximum 9

Sample

1st row unknown
2nd row unknown
3rd row unknown
4th row unknown
5th row unknown

Letter

Count 351574
Lowercase Letter 351574
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (cellular, unknown) take over 50.0%
  • The largest value (cellular) is over 2.25 times larger than the second largest value (unknown)

day

numerical

Approximate Distinct Count 31
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 15.8064
Minimum 1
Maximum 31
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • day is skewed right (γ1 = 0.0931)

Quantile Statistics

Minimum 1
5-th Percentile 3
Q1 8
Median 16
Q3 21
95-th Percentile 29
Maximum 31
Range 30
IQR 13

Descriptive Statistics

Mean 15.8064
Standard Deviation 8.3225
Variance 69.2636
Sum 714624
Skewness 0.09308
Kurtosis -1.0599
Coefficient of Variation 0.5265
  • day is not normally distributed (p-value 5.176943212167819e-06)

month

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3074348
  • The largest value (may) is over 2.0 times larger than the second largest value (jul)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row may
2nd row may
3rd row may
4th row may
5th row may

Letter

Count 135633
Lowercase Letter 135633
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The largest value (may) is over 2.0 times larger than the second largest value (jul)
  • month has words of constant length

duration

numerical

Approximate Distinct Count 1573
Approximate Unique (%) 3.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 258.1631
Minimum 0
Maximum 4918
Zeros 3
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • duration is skewed right (γ1 = 3.1442)

Quantile Statistics

Minimum 0
5-th Percentile 35
Q1 103
Median 180
Q3 319
95-th Percentile 751
Maximum 4918
Range 4918
IQR 216

Descriptive Statistics

Mean 258.1631
Standard Deviation 257.5278
Variance 66320.5741
Sum 1.1672e+07
Skewness 3.1442
Kurtosis 18.1518
Coefficient of Variation 0.9975
  • duration is not normally distributed (p-value 8.692489630216145e-15)
  • duration has 3235 outliers

campaign

numerical

Approximate Distinct Count 48
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 2.7638
Minimum 1
Maximum 63
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • campaign is skewed right (γ1 = 4.8985)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 8
Maximum 63
Range 62
IQR 2

Descriptive Statistics

Mean 2.7638
Standard Deviation 3.098
Variance 9.5977
Sum 124956
Skewness 4.8985
Kurtosis 39.2452
Coefficient of Variation 1.1209
  • campaign is not normally distributed (p-value 5.9520744161375996e-24)
  • campaign has 3064 outliers

pdays

numerical

Approximate Distinct Count 559
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 40.1978
Minimum -1
Maximum 871
Zeros 0
Zeros (%) 0.0%
Negatives 36954
Negatives (%) 81.7%
  • pdays is skewed right (γ1 = 2.6156)

Quantile Statistics

Minimum -1
5-th Percentile -1
Q1 -1
Median -1
Q3 -1
95-th Percentile 317
Maximum 871
Range 872
IQR 0

Descriptive Statistics

Mean 40.1978
Standard Deviation 100.1287
Variance 10025.7658
Sum 1.8174e+06
Skewness 2.6156
Kurtosis 6.9343
Coefficient of Variation 2.4909
  • pdays is not normally distributed (p-value 4.831294993565212e-25)
  • pdays has 8257 outliers

previous

numerical

Approximate Distinct Count 41
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 723376
Mean 0.5803
Minimum 0
Maximum 275
Zeros 36954
Zeros (%) 81.7%
Negatives 0
Negatives (%) 0.0%
  • previous is skewed right (γ1 = 41.8451)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 3
Maximum 275
Range 275
IQR 0

Descriptive Statistics

Mean 0.5803
Standard Deviation 2.3034
Variance 5.3058
Sum 26237
Skewness 41.8451
Kurtosis 4506.3621
Coefficient of Variation 3.9692
  • previous is not normally distributed (p-value 4.301593743636677e-25)
  • previous has 8257 outliers

poutcome

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3251512
  • The largest value (unknown) is over 7.54 times larger than the second largest value (failure)

Length

Mean 6.9186
Standard Deviation 0.3952
Median 7
Minimum 5
Maximum 7

Sample

1st row unknown
2nd row unknown
3rd row unknown
4th row unknown
5th row unknown

Letter

Count 312797
Lowercase Letter 312797
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (unknown, failure) take over 50.0%
  • The largest value (unknown) is over 7.54 times larger than the second largest value (failure)

y

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3034426
  • The largest value (no) is over 7.55 times larger than the second largest value (yes)

Length

Mean 2.117
Standard Deviation 0.3214
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 95711
Lowercase Letter 95711
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

Interactions

Correlations

Missing Values